Search CORE

Public Library of Science (PLOS)

The Relationship among Gene Expression, the Evolution of Gene Dosage, and the Rate of Protein Evolution

Author: Duret Laurent
Gout Jean-François
Kahn Daniel
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution

Polymorphix: a sequence polymorphism database

Author: Bazin Eric
Duret Laurent
Galtier Nicolas
Penel Simon
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

Within-species sequence variation data are of special interest since they contain information about recent population/species history, and the molecular evolutionary forces currently in action in natural populations. These data, however, are presently dispersed within generalist databases, and are difficult to access. To solve this problem, we have developed Polymorphix, a database dedicated to sequence polymorphism. It contains within-species homologous sequence families built using EMBL/GenBank under suitable similarity and bibliographic criteria. Polymorphix is an ACNUC structured database allowing both simple and complex queries for population genomic studies. Alignments within families as well as phylogenetic trees can be download. When available, outgroups are included in the alignment. Polymorphix contains sequences from the nuclear, mitochondrial and chloroplastic genomes of every eukaryote species represented in EMBL. It can be accessed by a web interface (http://pbil.univ-lyon1.fr/polymorphix/query.php)

CiteSeerX

Springer - Publisher Connector

Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries

Author: Damiola Francesca
Duret Laurent
Gandrillon Olivier
Keime Céline
Mouchiroud Dominique
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assignment of each 14 bp tag to the transcript from which it was derived. This process, called tag-to-gene mapping, represents a step that has to be improved in the analysis of SAGE libraries. Indeed, the existing web sites providing correspondence between tags and transcripts do not concern all species for which numerous EST and cDNA have already been sequenced. RESULTS: This is the reason why we designed and implemented a freely available tool called Identitag for tag identification that can be used in any species for which transcript sequences are available. Identitag is based on a relational database structure in order to allow rapid and easy storage and updating of data and, most importantly, in order to be able to precisely define identification parameters. This structure can be seen like three interconnected modules : the first one stores virtual tags extracted from a given list of transcript sequences, the second stores experimental tags observed in SAGE experiments, and the third allows the annotation of the transcript sequences used for virtual tag extraction. It therefore connects an observed tag to a virtual tag and to the sequence it comes from, and then to its functional annotation when available. Databases made from different species can be connected according to orthology relationship thus allowing the comparison of SAGE libraries between species. We successfully used Identitag to identify tags from our chicken SAGE libraries and for chicken to human SAGE tags interspecies comparison. Identitag sources are freely available on web site. CONCLUSIONS: Identitag is a flexible and powerful tool for tag identification in any single species and for interspecies comparison of SAGE libraries. It opens the way to comparative transcriptomic analysis, an emerging branch of biology

Springer - Publisher Connector

Natural history of the ERVWE1 endogenous retroviral locus

Author: Beliaeff Jean
Bonnaud Bertrand
Bouton Olivier
Duret Laurent
Mallet François
Oriol Guy
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The human HERV-W multicopy family includes a unique proviral locus, termed ERVWE1, whose full-length envelope ORF was preserved through evolution by the action of a selective pressure. The encoded Env protein (Syncytin) is involved in hominoid placental physiology. RESULTS: In order to infer the natural history of this domestication process, a comparative genomic analysis of the human 7q21.2 syntenic regions in eutherians was performed. In primates, this region was progressively colonized by LTR-elements, leading to two different evolutionary pathways in Cercopithecidae and Hominidae, a genetic drift versus a domestication, respectively. CONCLUSION: The preservation in Hominoids of a genomic structure consisting in the juxtaposition of a retrotransposon-derived MaLR LTR and the ERVWE1 provirus suggests a functional link between both elements

Hal-Diderot

LALNVIEW: a graphical viewer for pairwise sequence alignments

Author: Elisabeth Gasteiger
Guy Perrièe
Laurent Duret
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/1996
Field of study

arXiv.org e-Print Archive

Direct Text to Speech Translation System using Acoustic Units

Author: Duret Jarod
Gimeno Pablo
Khurana Sameer
Laurent Antoine
Mingote Victoria
Vicente Luis
Publication venue
Publication date: 14/09/2023
Field of study

This paper proposes a direct text to speech translation system using discrete acoustic units. This framework employs text in different source languages as input to generate speech in the target language without the need for text transcriptions in this language. Motivated by the success of acoustic units in previous works for direct speech to speech translation systems, we use the same pipeline to extract the acoustic units using a speech encoder combined with a clustering algorithm. Once units are obtained, an encoder-decoder architecture is trained to predict them. Then a vocoder generates speech from units. Our approach for direct text to speech translation was tested on the new CVSS corpus with two different text mBART models employed as initialisation. The systems presented report competitive performance for most of the language pairs evaluated. Besides, results show a remarkable improvement when initialising our proposed architecture with a model pre-trained with more languages.Comment: 5 pages, 4 figure

Mutation Patterns in the Human Genome: More Variable Than Expected

Author: Arndt
Baer
Bird
Birdsell
Cooper
Coulondre
Denver
Duret
Frederico
Fryxell
Goetting-Minesky
Green
Haag-Liautard
Hardison
Hess
Hodgkinson
Hwang
Jeffreys
Kimura
Kondrashov
Laurent Duret
Ley
Lynch
Makova
Matassi
Nikolaev
Tian
Walser
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Why are some genomic positions more mutable than others? The identification of cryptic mutation hotspots in the human genome indicates that the determinants of mutation rates are more complex than anticipated